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ABSTRACT: 

According to a first aspect of the present invention, a method for linking bytecodes of an 
uninterrupted block of bytecodes in the formation of a data flow graph comprises the steps of 
scanning the uninterrupted block of bytecodes in a forward manner to identify the start of each of 
the bytecodes, scanning in a backward manner bytecodewise each of the bytecodes in the 
uninterrupted block of bytecodes, and generating a link in the data flow graph that links each of 
the bytecodes to all other of the bytecodes used by the each of the bytecodes. According to a 
second aspect of the present invention, a method for linking bytecodes between uninterrupted 
blocks of bytecodes in the formation of a data flow graph, the uninterrupted blocks of bytecodes 
having links according to an order of execution of the unintemjpted blocks and wherein a stack 
state has been generated for each of the uninterrupted blocks of bytecodes, comprises the steps 
of stepping through a first path of a plurality of paths of the order of execution that terminates in a 
join to generate a link in the data flow graph between each bytecode producing a value in one of 
the uninterrupted blocks and each bytecode consuming the value in another of the uninterrupted 
blocks in the first path, and duplicating each link in the first path with a link for each bytecode in 
all of the plurality of paths other than the first path for each bytecode producing a value having a 
similar stack location to each bytecode producing a value in one of the uninterrupted blocks in 
the first path. 
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(54) Method for generating a Java bytecode data flow graph 



(57) According to a first aspect of the present inven- 
tion, a method for linking bytecodes of an uninterrupted 
block of bytecodes in the formation of a data flow graph 
comprises the steps of scanning the unintenupted block 
of bytecodes in a forward manner to identify the start of 
each of the bytecodes, scanning in a backward manner 
bytecodewise each of the bytecodes in the uninterrupt- 
ed block of bytecodes, and generating a link in the data 
flow graph that links each of the bytecodes to all other 
of the bytecodes used by the each of the bytecodes. 

According to a second aspect of the present inven- 
tion, a method for linking bytecodes between uninter- 
rupted blocks of bytecodes in the formation of a data 
flow graph, the uninterrupted blocks of bytecodes hav- 



ing links according to an order of execution of the unin- 
terrupted blocks and wherein a stack state has been 
generated for each of the uninterrupted blocks of byte- 
codes, comprises the steps o|jtepping through a first 
path of a plurality of paths of the order of execution that 
temninates in a join to generate a link in the data flow 
graph between each bytecode producing a value in one 
of the uninterrupted blocks and each bytecode consum- 
ing the value in another of the uninterrupted blocks in 
the first path, and duplicating each link in the first path 
with a link for each bytecode in all of the plurality of paths 
other than the first path for each bytecode producing a 
value having a similar stack location to each bytecode 
producing a value in one of the uninterrupted blocks in 
the first path. 
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D scription 

[0001] The present invention relates to the optimiza- 
tion of Java class files. More particularly, the present in- 
vention relates to the generation of aspects of a data 
flow graph fronn a Java classfile optimizer that Identifies 
the propagation of data values in blocks of bytecodeand 
between blocks of bytecode. 

[0002] A known problem for software devebpers and 
computer users is the lack of portability of software 
across operating system platforms. Attempts to address 
this problem must include means of ensuring security 
as computers are linked together in ever expanding net- 
works, such as the World Wide Web. As a response to 
both of these concerns, the JAVA programming lan- 
guage was developed at Sun Microsystems as a plat- 
form independent, object oriented computer language 
designed to include several layers of security protection. 

[0003] Java achieves Its operating system independ- 
ence by being both a compiled and interpreted lan- 
guage. First, Java source code, which consists of Java 
classfiles, is compiled into a generic intermediate fonnat 
called Java bytecode. Java's bytecodes consist of a se- 
quence of single byte opcodes, each of which identify a 
particular operation to be carried out. Additionally, some 
of the opcodes have parameters. For example, opcode 
number 21 , iload <vamum>, takes the single-word inte- 
ger value stored in the local variable, varnum, and push- 
es it onto a stack. 

[0004] Next, the bytecodes are interpreted by a Java 
Virtual Machine (JVM) which translates the bytecodes 
into native machine code. The JVM is a stacked-based 
Implementation of a "virtual" processor that shares 
many characteristics with physical microprocessors. 
The bytecodes executed by the JVM are essentially a 
machine instruction set, and as will be appreciated by 
those of ordinary skill In the art, are similar to the as- 
sembly language of a computing machine. Accordingly, 
every hardware platform or operating system may have 
a unique implementationof the JVM, called a Java Runt- 
ime System, to route the universal bytecode calls to the 
underlying native system. 

[0005] When developing the JAVA bytecode instruc- 
tion set. the designers sought to ensure that it was sim- 
ple enough for hardware optimization and also included 
verjfication means to provide security protection and to 
prevent the execution errors or system crashes that can 
result from improperly formatted bytecode. As Java's 
bytecodes contain significant type information, the ver- 
ification means are able to do extensive type checking 
when the bytecodes are first retrieved from the internet 
or a local disk. As a result, the interpreter of the native 
machine need only perform minimal type checking at 
run time. Unlike languages such as SmallTalk that pro- 
vide protection by performing extensive runtime checks, 
Java executes more quickly at run time. 
[0006] Although Java provides security through veri- 



fication means and portability through bytecodes, Java 
programs lag natively compiled programs, written in lan- 
guages like C/C++, In their execution time. When a user 
activates a Java program on a Web Page, the user must 

s wail not only for the program to download but also to be 
interpreted. To Improve Java's execution time, optimiza- 
tions can be introduced into the processing of Java byte- 
codes. These optimizations can be implemented in a va- 
riety of noanners including as Stand-Alone Optimizers 

10 (SAO) or as part of Justnn-Time (JIT) compilers. 

[0007] A SAO transforms an input classfile containing 
bytecode into an output classfile containing bytecodes 
that more efficiently perform the same operations. A JIT 
transforms an input classfile containing byte code into 

15 an executable program. Prbr to the development of 
JITs, a JVM would step through all the bytecode instruc- 
tions in a program and mechanically perform the native 
code calls. With a JIT compiler, however, the JVM first 
makes a call to the JIT which compiles the Instructions 

20 into native code that is then run directly on the native 
operating system. The JIT compiler pennits natively 
complied code to run faster and makes it so that the 
code only needs to be compiled once. Further, JIT com- 
pilers offer a stage at which the executable code can be 

25 optimized. 

[0008] To either optimize or compile bytecodes in- 
volves the translatbn of the source bytecodes into what 
is known in the art as an intermediate representation 
(IR). The IR provides information about two essential 

30 components of a program: the?eontrol flow graph (CFG) 
and the data flow graph (DFG). Subsequently, the IR is 
transformed for compilers into object code and for opti- 
mizers into an improved version of the source code for- 
mat. 

35 [0009] The CFG breaks the code into blocks of byte- 
code that are always performed as an uninterrupted 
group and establishes the connections that link the 
blocks together The DFG maps the connections be- 
tween where values are produced and where they are 

40 used. This includes connections within blocks of byte- 
codes and also connections between blocks of byte- 
codes. Because Java programs frequently use stack lo- 
cations as implicit variables, tracking the propagation of 
values is a difficult process. When extracting the data 

45 flow information to generate the DFG for the I R. the prior 
art approach has been to perform a complete simulation 
of both the stack and iterations through all possible flow 
paths. 

[0010] In contrast to traditional compilers where end 
50 users only interact with compiled programs, the run time 
nature of Java creates a significant incentive to optimize 
the speed and efficiency of Java optimizers and JIT 
compilers. Accordingly, it is an object of the present in- 
vention to provide a method for generating a DFG from 
55 the source bytecodes to form an IR that optimizes the 
speed and efficiency of Java optimizers and JIT compil- 
ers. 

[0011] The present invention is directed to the oplimi- 
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zation of Java class files by improving the efficiency of 
generating a DFG for the IR. 

[001 2] According to a first aspect of the present inven- 
tion, a forward scan of an uninterrupted bytecode block 
from a bytecode source file is performed to identify the 
start of each bytecode in the uninterrupted block. Next» 
a backwards scan Is performed. For each bytecode in 
the block, a DFG node is created, links to all the values 
used by the bytecode in the block are established, and 
links to preceding or subsequent nodes in the file are 
created. 

[0013] According to a second aspect of the invention, 
the bytecode blocks are linked in the file's order of exe- 
cution and a stack state is generated for each block of 
code. For each join statement generated by the CFG, 
one path of the CFG is stepped through. Next, links are 
established in the DFG between locations where values 
are used and where the values are produced. Finally, 
for all blocks parallel to the blocks producing values the 
links for bytecodes occupying the same stack state lo- 
cation are duplicated. 

[0014] The present invention will be described below 
with reference to exemplary embodiments and the ac- 
companying drawings, in which: 
[0015] FIG. 1 A is a flow diagram of the optimization 
of a JAVA bytecode source file using a stand alone op- 
timizer according to the present invention. 
[0016] FIG. 1B is a flow diagram of the optimization 
of JAVA bytecode source file using a just in time compiler 
according to the present invention. 
[0017] FIG. 2 is a flow diagram illustrating the gener- 
ation of control flow graph from a bytecode source file 
suitable for use in the present invention. 
[0018] FIG. 3 is a flow diagram of the generation of 
data flow graph according to the prior art. 
[0019] FIG. 4 is a flow diagram of a first aspect of the 
generation of a data flow graph for the links in a block 
of bytecode according to the present invention. 
[0020] FIG. 5A is a block diagram for the generation 
of a node in a data flow graph for the instruction I ADD 
according to the first aspect of the present invention. 
[0021] FIG. 5B is a btock diagram for the generation 
of several nodes in a data flow graph for a plurality of 
instructions according to the first aspect of the present 
inventbn. 

[0022] FIG. 6 is a flow diagram of a control flow graph 
suitable for illustrating a second aspect of the generation 
of a"clata flow graph for the links between blocks of byte- 
code according to the present invention. 
[0023] FIG. 7A is a block diagram of the steps in the 
method according to the second aspect of the present 
invention. 

[0024] FIG. 7B is a block diagram for the generation 
of nodes in a data flow graph according to the second 
aspect of the present invention. 
[0025] Those of ordinary skill in the art will realize that 
the following description of the present invention is illus- 
trative only and is not intended to be in any way limiting. 



Other embodiments of the invention will readily suggest 
themselves to such skilled persons from an examination 
of the within disclosure. 

[0026] Refemng first to FIGS. 1 A and IB, simplified 

5 flow diagrams 10 and 20 of the optimization of a Java 
bytecode source file are illustrated. In both flow dia- 
grams 10 and 20, Java source code file 12 is compiled 
by a Java compiler 1 4 into Java bytecode classfiles 16. 
In flow diagram 10. the classfiles are operated on by a 

fo SAO 18, and in flow diagram 20. the classfiles are 
passed through a Java Runtime System 22 toa JIT com- 
piler 24. The Java SAO 18 outputs Java classfiles 26 
containing bytecodes that perform the same operations 
as the input classfiles, only in an optimized manner. The 

IS JIT 24 outputs executable code 28 that can run directly 
on the native operating system 30. Those of ordinary 
skill in the art will recognize that the procedures de- 
scribed in FIG. 1 are merely illustrative and that there 
exist other structures within which classfiles may be op- 

20 timized. 

[0027] In processing an input bytecode source class- 
file, both an SAO and a JIT form an IR. As described 
above, an IR is a succinct and straightfonward structur- 
ing of the control and data flow information of a program 

25 into a CFG and DFG. The CFG represents the informa- 
tion regarding the sequence and order of the execution 
of the bytecode instructions as blocks of code linked in 
the order of execution of the program. The blocks of 
code are sections of bytecode that are always per- 

30 formed as an uninterrupted ga)up, and the links be- 
tween the blocks of code are bytecode branch, condi- 
tional, or jump instructions. 

[0028] FIG. 2 is a flow diagram that illustrates the gen- 
eration of the CFG from a bytecode classfile 40. It should 

35 be appreciated that there are several known methods 
for generating a CFG. The specifics of any such method 
will not be disclosed herein to avoid overcomplicating 
the disclosure and thereby obscuring the present inven- 
tion. The procedure begins with a classfile 40 containing 

40 an ordered sequence of bytecode instructions num- 
bered from 1 through N. The numbers 1 through N are 
analogous to the numbering of lines in a computer pro- 
gram as is known in the art Next, an optimizer 42 proc- 
esses the bytecode classfile 40 to extract the control 
flow information from the bytecodes. 
[0029] The control flow informalbn is illustrated by the 
CFG 44. For example, the CFG 44 may indicate that for 
classfile 40 that bytecode instructions 1 through 3 are 
first executed, instructions 100 through 103 are next ex- 

50 ecuted. either instructions 10 through 14 or 15 through 
19 are next executed, and finally instructions 4 through 
8 are executed before the remainder of the bytecode 
instructions execute. It will be appreciated that the CFG 
44 is an example only, and that any number of CFGs 

55 may be generated from the classfile 40 by the optimizer 
42. 

[0030] As described atx3ve, once the CFG has been 
generated, the DFG will be generated. According to the 
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present invention, two aspects of tine DFG are generat- 
ed from the CFG. In one aspect, the propagation of val- 
ues within blocks of bytecode is determined. As is 
known in a typical JAVA program, data values are pro- 
duced in one location and consumed in another. The first 
aspect of the present invention relates to establishing 
the connections between the production and consump- 
tion points that occur within uninterrupted blocks of 
code. The present invention allows implicit variables to 
be easily recognized and for their values to be immedi- 
ately linked in the DFG to the bytecode(s) using them. 
[0031] Unlike the prior art. the present invention 
avoids the approach taken in the prior art of constructing 
a stack to determine the values used by bytecodes. For 
example, in FIG. 3, a block diagram of a known method 
for generating links within a code block of a CFG is il- 
lustrated. The diagram demonstrates the principle stag- 
es employed to determine the value of implicit stack lo- 
cation variables when generating a DFG. During stage 
60, the operations of an input sequence of bytecodes 
62 are simulated to create a version of the stack 64 that 
they represent. The stack shown 64 presents only a por- 
tion 66 of the input sequence of bytecodes 62. During 
stage 68, the stack 64 is executed, and it is determined 
that the POP instruction 70 is associated with the value 
"2" due to the preceding BIPUSH "2" 72 instruction. This 
information is stored in a DFG node represented by ref- 
erence numerals 74 and 76. 

[0032] According to the preferred embodiment of the 
present inventbn, a node tree is employed. When val- 
ues are produced by the bytecode classfile, a new node 
is created in the tree. This node is then connected to the 
rest of the tree by linking it to the value's production and 
consumption points, as they become known. In this 
manner, the propagation of data values through a pro- 
gram can be readily traced. Those of ordinary skill in the 
art will recognize that a variety of data structures could 
be used. 

[0033] Turning now to FIG. 4, a flow diagram of a 
method for generating links within a bytecode block ac- 
cording to the present invention is illustrated. In a first 
step 100, a forward scan of a bytecode source file 102 
is performed in order to determine the starts 106, 108, 
and 1 1 0 of the bytecodes 1 1 2, 1 1 4, and 1 1 6, respective- 
ly, because as is known, bytecodes can be of different 
lengths. Next, at step 118 a backward scan of the byte- 
code source file 102 is performed. During the backward 
— scan step 118, the nodes of the DFG are generated as 
they occur. For example, if bytecode 116 is the instruc- 
tion POP, as POP is read, a node 120 is created in the 
DFG including a POP instruction 122. 
[0034] If next, the instruction BIPUSH "2" is read at 
bytecode 108, the value "2" at 124 is linked to the POP 
instruction 122 that consumes it. 
[0035] The present invention provides a significant in- 
crease in the efficiency in the generation of the DFG be- 
cause the entire process of constructing a copy of the 
stack has been eliminated. The invention recognizes 



that because the stream of bytecodes are the instruc- 
tions for a stack machine, the operands or values con- 
sumed by a bytecode instruction precede the instruc- 
tion. When the bytecode fil is read in reverse order, the 

5 step of creating a node in the DFG can be immediately 
performed after the bytecode Is read. 
[0036] Further examples, not intended to be exhaus- 
tive, are illustrated in FIGS. 5A and 5B of the nodes in 
a DFG created from bytecode instructions according to 

10 the present invention. FIGS. 5A and 5B respectively are 
examples of a single instruction with two links, and a 
series of instructions with multiple links. 
[0037] In FIG. 5A. a bytecode source block 150 has 
been first differentiated by a forward scan into byte- 

15 codes 1 52, 1 54 and 1 58. When the backward scan step 
is performed, an integer addition (lADD) instruction is 
read from bytecode 158, and a node 160 is created in 
the DFG. From the definition of the instruction lADD It 
is known that node 1 60 will consume two values. As the 

20 backward scan continues, these two values are read 
from bytecodes 1 54 and 1 52, created as nodes 162 and 
1 64 In the node tree of the DFG, and then linked to node 
160. 

[0038] In FIG. 5B, a bytecode source block 166 has 

25 been first differentiated by a forward scan Into byte- 
codes 168. 170 and 172. The sequence of bytecodes 
166 Implement the statement (X + X). In the backwards 
scan step the first bytecode 1 72 encountered is an I ADD 
instruction. As explained with respect to the example of 

30 FIG. 5A, the lADD instaiction generates a node 174 in 
the DFG with links to two vallSs. Since the next byte- 
code 170 is a duplicate (DUP) instruction both of the 
links from node 174 are made to the node 176 for the 
DUP instruction. Next, bytecode 168 having an integer 

35 load X (I LOADX) instruction is read, and a node 1 78 with 
a link to node 176 is created. In this sequence in the 
node tree, the value in node 178, will be duplicated in 
node 176. and both values will be consumed in node 
174. As will be appreciated by those of ordinary skill in 
the art from the above recited examples, when perform- 
ing the backwards scan step, the particu lar DFG created 
will depend upon the sequence of bytecodes in the 
blocks generated by the CFG. 
[0039] According to a second aspect of the present 

^#5 invention, the propagation of values between blocks of 
bylecode in the CFG is determined. FIG. 6 illustrates a 
CFG 180 having several blocks of linked code 200. 202. 
204. and 206. In the CFG 180. the blocks 202 and 206, 
that precede a join 208 are considered parallel. It should 

50 be appreciated that a join may connect a plurality of par- 
allel blocks. When the CFG is generated, stack states, 
210, 212. 214. and 216 are created for each block of 
bytecode 200, 202. 204, and 206. respectively The 
stack states includes all the values still present on the 

55 stack after the bytecodes in the block associated with 
that stack have been executed. It should be appreciated 
that values on the stack are produced by instructions in 
the bytecode blocks. This is shown by the mapping of 
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the instructions 218 and 220 to the stack states 222 and 
224, respectively. For purposes of illustration to be de- 
scribed below, instruction 218 is PUSH 'T, instruction 
220 is PUSH "0", and instruction 226 is POP. 
[0040] To determine when a value produced in an ear- 
ly block of bytecode is used or consumed by a subse- 
quent block of bytecode to generate a link In the DFG 
between blocks of bytecode, an execution of the blocks 
is made while noting the consumption of a prbr stack 
state value and identifying its source. This information 
is represented in the DFG as a link between the nodes 
of the corisumption and production of the bytecodes. To 
establish links for all parallel blocks, the method in the 
prior art relied upon iterating through all possible control 
flow paths in the CFG. For example, to generate the 
DFG for links between blocks in the CFG 180, a first 
pass would be made through the path including blocks 
200. 202 and 204, and then a second pass would be 
made through the path including blocks 200, 206 and 
204. 

[0041] According to the present invention, not all con- 
trol flow paths must be traversed to form the DFG for 
links between parallel blocks of bytecode. Rather, the 
links in the DFG between blocks 200, 202, 204 and 206 
can be generated simply by stepping through blocks 
200, 202, 204 and then applying the information from 
the pass through blocks 200, 202. 204 to establish the 
lin ks to block 206. The present inventbn exploits the fact 
that Java standards' require and the verification de- 
scribed above ensures that in properly formatted code, 
the stack states for any of the blocks preceding a join 
are the same. 

[0042] For example, in CFG 180, both of the stack 
states 212 and 216 that precede the join 208 must be 
the same in both size and relative type. During execu- 
tion, the consumption of any value in block 202 will be 
paralleled by a consumption of a value in block 206. As 
a result, for example, if the value corresponding to the 
stack state location 222 is used by the bytecode instruc- 
tion in 226 of code block 204, then the bytecode instruc- 
tion 226 also uses the value corresponding to the stack 
state location 224. 

[0043] The following lines of pseudocode illustrate the 
point. 



IF (some condition occurs) 



X = 0; 

10 



IS 

A = X + 2; 

[0044] The assignment of a value to "A" in the expres- 
20 slon "A = X + 2" uses the value from either expression 
"X = 1 • or X = 0" that has been assigned to the implicit 
stack variable "X" as determined by the "IF (some con- 
dition occurs)" statement. 

[0045] In FIG. 7A. a block diagram outlining the steps 

25 employed to generate a DFG according to the second 
aspect of the present invention is illustrated. 
[0046] At step 250, a first branch of a CFG graph is 
scanned. Applied to CFG 180 in FIG. 6, a first pass 
through blocks 200, 202 and 204 is performed. 

30 [0047] Next at step 252. linl^between the place a val- 
ue is created and the place a value is consumed are 
made. As illustrated in FIG.7B, a link 254 is made be- 
tween the PUSH "I" instruction at location 218 in the 
CFG 1 80 that the POP instructran consumes at location 

55 226 in the CFG 180. 

[0048] Next at step 256. a link 258 is made between 
the PUSH 0 instruction at location 220 that the POP in- 
struction consumes at location 226, because it is known 
that the data for this instruction occupies the same re- 
spective stack state location as the PUSH 1 instruction 
at location 21 8 in the CFG 1 80. As such, without travers- 
ing all paths in the CFG as relied upon by the prior art, 
according to the second aspect of the present invention, 
the links in a DFG between blocks of bytecodes can be 

^^5 more efficiently made. As joins are quite common 
throughout bytecode source code, the present invention 
represents a significant increase in efficiency Where 
multiple joins exist, the increase in efficiency is even 
greater. 

so [0049] While illustrative embodiments and applica- 
tions of this invention have been shown and described, 
it would be apparent to those skilled in the art that many 
more modifications than have been mentioned above 
are possible without departing from the inventive con- 

55 cepts set forth herein. The invention, therefore, is not to 
be limited except in the scope of the appended claims. 
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Claims 

1. A method for linking bytecodes of an uninterrupted 
block of bytecodes in the formafton of a data flow 
graph comprising the steps of: s 

scanning said uninterrupted block of bytecodes 
in a forward manner to identify the start of each 
of said bytecodes; 

scanning in a backward manner bytecodewise io 

each of said bytecodes in said uninterrupted 

block of bytecodes; and 

gisnerating a link in said data flow graph that 

links each of said bytecodes to ail other of said 

bytecodes used by said each of said byte- is 

codes. 

2. A method for linking bytecodes between uninter- 
rupted blocks of bytecodes in the formation of a data 
flow graph, said uninterrupted blocks of bytecodes 20 
having links according to an order of execution of 
said uninterrupted blocks and wherein a stack, state 
has been generated for each of said uninterrupted 
blocks of bytecodes, comprising the steps of: 

25 

stepping through a first path of a plurality of 
paths of said order of execution that terminates 
in a join to generate a link in said data flow 
graph between each bytecode producing a val- 
ue in one of said uninterrupted blocks and each 30 
bytecode consuming said value in another of 
said uninterrupted blocks in said first path; and 
duplicating each link in said first path with a link 
for each bytecode in all of said plurality of paths 
other than said first path for each bytecode pro- 35 
ducing a value having a similar stack location 
to each bytecode producing a value in one of 
said uninterrupted blocks in said first path. 

3. A computer readable storage medium having re- 40 
corded thereon computer program code compo- 
nents that, when loaded onto a computer and exe- 
cuted, cause that computer to operate according to 
the method of claim 1 or claim 2. 
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